Modelling pitch accent types for Polish speech synthesis
نویسندگان
چکیده
We describe a Polish prosody modelling module for the Festival speech synthesis system. The module uses classification and regression trees for accent type prediction and a linear regression technique for F0 contour generation for these contours. The techniques used to attempt to overcome problems with the only available data are shown. We demonstrate how improvements were achieved by the use of a modified F0 stylisation, accent type clustering and language specific features. Results of a formal perception study show a significant preference for the new intonation model over the original one.
منابع مشابه
Comparative investigation of peak alignment in Polish and German unit selection corpora
This paper presents a comparative study on the temporal alignment of pitch peaks of H*L accents in Polish and German. Speech material used in the study came from the unit selection synthesis corpora of the Polish voice module of the BOSS system and the IMS German Festival TTS system. The major factors investigated were concerned with the influence of syllable structure on the one hand, as well ...
متن کاملF0 contour and segmental duration modeling using prosodic features
This paper proposes a framework of F0 contour generation and segmental duration modeling for application in a unit-selection speech synthesis system for Polish – BOSS. We describe the design of the F0 and duration modeling modules and emphasize the role of prosodic features (related to stress, pitch accent and phrase) in these two tasks.
متن کاملUniversal and Language-specific English and P
We compared nuclear accent production in English and Polish read speech. We investigated declaratives and three types of questions. We expected to find (a) cross-linguistic differences and (b) a cross-language generalisation which may be evidence for an intonational universal. The generalisation under investigation was a trade-off between syntactic or lexical question markers in the text and th...
متن کاملProsody annotation for corpus based speech synthesis
The paper concerns prosody annotation especially for application in a corpus based speech synthesis. In order to establish the rules of automatic intonation modelling, phonetically labeled speech database of 4 hours has been perceptually and acoustically analyzed. The speech material included different text types and prosodically rich phrases. The annotation of the speech database consists in p...
متن کاملModelling prominence and emphasis improves unit-selection synthesis
We describe the results of large scale perception experiments showing improvements in synthesising two distinct kinds of prominence: standard pitch-accent and strong emphatic accents. Previously prominence assignment has been mainly evaluated by computing accuracy on a prominence-labelled test set. By contrast we integrated an automatic pitch-accent classifier into the unit selection target cos...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
عنوان ژورنال:
دوره شماره
صفحات -
تاریخ انتشار 2005